Discover Relevant Environment Feature Using Concurrent Reinforcement Learning

نویسندگان

Zhihui Luo

David A. Bell

Barry McCollum

چکیده

In order to compare the policies more efficiently, we introduce a new reinforcement learning method called concurrent biased learning. This is a multi-thread learning method, in which each learning thread refers to one feature of the environment. If an agent intentionally focuses on part of these environmental features to learn a policy of a task, we call this method a biased learning; otherwise, if an agent uses all features that it perceives to learn a task, we call this unbiased learning. We present a method concerning the relevance of information in order to improve the learning of a reinforcement learning robot. We introduce a new concurrent online learning algorithm to calculate the contribution C(s) and relevance degree I(s) to quantify the relevancy of features with respect to a desired learning task. Our analysis shows that the correlation relationship of the environment features can be extracted and projected to concurrent learning threads. By comparing the contribution of these learning threads, we can evaluate the relevance degree of a feature when performing a particular learning task. ( ) i s π If the agent learns a policy with respect to one of the environmental features , we call this biased learning with respect to feature i . The biased Q-value can be denoted as . Then the Bellman update function of the biased Q-value is: i

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Running head : AGING , REINFORCEMENT LEARNING AND ATTENTION 1 The effects of aging on the interaction between reinforcement learning and attention

Reinforcement learning (RL) in complex environments relies on selective attention to uncover those aspects of the environment that are most predictive of reward. While previous work has focused on age-related changes in RL, it is not known whether older adults learn differently from younger adults when selective attention is required. In two experiments, we examined how aging impacts on the int...

متن کامل

The effects of aging on the interaction between reinforcement learning and attention.

Reinforcement learning (RL) in complex environments relies on selective attention to uncover those aspects of the environment that are most predictive of reward. Whereas previous work has focused on age-related changes in RL, it is not known whether older adults learn differently from younger adults when selective attention is required. In 2 experiments, we examined how aging affects the intera...

متن کامل

Learning Visual Feature Spaces for Robotic Manipulation with Deep Spatial Autoencoders

Reinforcement learning provides a powerful and flexible framework for automated acquisition of robotic motion skills. However, applying reinforcement learning requires a sufficiently detailed representation of the state, including the configuration of task-relevant objects. We present an approach that automates state-space construction by learning a state representation directly from camera ima...

متن کامل

A multiagent architecture for concurrent reinforcement learning

In this paper we propose a multiagent architecture for implementing concurrent reinforcement learning, an approach where several agents, sharing the same environment, perceptions and actions, work towards one only objective: learning a single value function. We present encouraging experimental results derived from the initial phase of our research on the combination of concurrent reinforcement ...

متن کامل

RRLUFF: Ranking function based on Reinforcement Learning using User Feedback and Web Document Features

Principal aim of a search engine is to provide the sorted results according to user’s requirements. To achieve this aim, it employs ranking methods to rank the web documents based on their significance and relevance to user query. The novelty of this paper is to provide user feedback-based ranking algorithm using reinforcement learning. The proposed algorithm is called RRLUFF, in which the rank...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2008

Discover Relevant Environment Feature Using Concurrent Reinforcement Learning

نویسندگان

چکیده

منابع مشابه

Running head : AGING , REINFORCEMENT LEARNING AND ATTENTION 1 The effects of aging on the interaction between reinforcement learning and attention

The effects of aging on the interaction between reinforcement learning and attention.

Learning Visual Feature Spaces for Robotic Manipulation with Deep Spatial Autoencoders

A multiagent architecture for concurrent reinforcement learning

RRLUFF: Ranking function based on Reinforcement Learning using User Feedback and Web Document Features

عنوان ژورنال:

اشتراک گذاری